Model selection for prognostic time-to-event gene signature discovery with applications in early breast cancer data.
نویسندگان
چکیده
Model selection between competing models is a key consideration in the discovery of prognostic multigene signatures. The use of appropriate statistical performance measures as well as verification of biological significance of the signatures is imperative to maximise the chance of external validation of the generated signatures. Current approaches in time-to-event studies often use only a single measure of performance in model selection, such as logrank test p-values, or dichotomise the follow-up times at some phase of the study to facilitate signature discovery. In this study we improve the prognostic signature discovery process through the application of the multivariate partial Cox model combined with the concordance index, hazard ratio of predictions, independence from available clinical covariates and biological enrichment as measures of signature performance. The proposed framework was applied to discover prognostic multigene signatures from early breast cancer data. The partial Cox model combined with the multiple performance measures were used in both guiding the selection of the optimal panel of prognostic genes and prediction of risk within cross validation without dichotomising the follow-up times at any stage. The signatures were successfully externally cross validated in independent breast cancer datasets, yielding a hazard ratio of 2.55 [1.44, 4.51] for the top ranking signature.
منابع مشابه
Identification of Prognostic Genes in Her2-enriched Breast Cancer by Gene Co-Expression Net-work Analysis
Introduction: HER2-enriched subtype of breast cancer has a worse prognosis than luminal subtypes. Recently, the discovery of targeted therapies in other groups of breast cancer has increased patient survival. The aim of this study was to identify genes that affect the overall survival of this group of patients based on a systems biology approach. Methods: Gene expression data and clinical infor...
متن کاملمقایسه مدلهای بیزی پارامتریک در تحلیل عوامل مؤثر بر میزان بقای بیماران مبتلا به سرطان معده
Background & Objectives: The Cox proportional-hazards regression and other parametric models model have achieved widespread use in the analysis of time-to-event data with censoring and covariates. However employing Bayesian method has not been widely used or discussed. The aim of this study was to evaluate the prognostic factors in using Bayesian interval censoring analysis.Methods: This cohort...
متن کاملGene-Gene Interaction Study Between Genetic Polymorphisms of Folate Metabolism and MTR SNPs on Prognostic Features Impact for Breast Cancer
Background: Breast Cancer (BC), the second leading cause of cancer mortality after lung cancer and varied across the world due to genetic and environmental factors. In this study, we evaluated the interaction between the polymorphisms in genes encoding enzymes of folate metabolism: methylenetetrahydrofolate reductase (MTHFR), methionine synthesis reductase (MTR) with the BC prognostic factors. ...
متن کاملبیان ژن HER4 درنمونه های بلوک پارافینه بیماران مبتلا به سرطان پستان
Background: the breast cancer is the second cause of worldwide death. Understanding of molecular pathology of breast cancer can provide useful information about new treatment routes. HER4 gene considered as a molecular pre-prognostic marker in cancers recently. So the purpose of this research is studying of HER4 gene expression in breast cancer patients. Methods: in this study 70 samples of ...
متن کاملDiagnosis of Breast Cancer Subtypes using the Selection of Effective Genes from Microarray Data
Introduction: Early diagnosis of breast cancer and the identification of effective genes are important issues in the treatment and survival of the patients. Gene expression data obtained using DNA microarray in combination with machine learning algorithms can provide new and intelligent methods for diagnosis of breast cancer. Methods: Data on the expression of 9216 genes from 84 patients across...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Statistical applications in genetics and molecular biology
دوره 12 5 شماره
صفحات -
تاریخ انتشار 2013